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C<\ • Abstract 

> ■ 

■ In the context of proving lower bounds on proof space in fc-DNF resolution, [Ben-Sasson and Nord- 
PsJ , Strom 2009] introduced the concept of minimally unsatisfiable sets of fc-DNF formulas and proved that a 

minimally unsatisfiable fc-DNF set with m formulas can have at most 0((r7ifc)*''+^) variables. They also 
I gave an example of such sets with n{mk'^) variables. 

■ In this paper we significantly improve the lower bound to f7(m)'^, which almost matches the upper 
bound above. Furthermore, we show that this implies that the analysis of their technique for proving 
time-space separations and trade-offs for fc-DNF resolution is almost tight. This means that although 
it is possible, or even plausible, that stronger results than in [Ben-Sasson and Nordstrom 2009] should 
hold, a fundamentally different approach would be needed to obtain such results. 
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: 1 Introduction 

A formula in conjunctive normal form, or CNF formula, is said to be minimally unsatisfiable if it is un- 
satisfiable but deleting any clause makes the formula satisfiable. A well-known result by Tarsi |AL86|, 
reproven several times by various authors (see, for instance, t BETOTI fCS88l IKulOOl ). states that the number 
of variables in any such CNF formula is always at most (m — 1), where m is the number of clauses. 

Motivated by certain problems in proof complexity related to the space measure in the so-called k-DNF 
resolution proof systems introduced by Krajfcek [KraOl |, Ben-Sasson and Nordstrom |BSN09a| developed 
a generalization of the concept of minimal unsatisfiability to conjunctions of formulas in disjunctive normal 
form where all terms in the disjunctions have size at most k, henceforth k-DNF formulas. We begin by 
reviewing their definition. 
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Assume that B = {-Di, . . . , Dm} is the set of A;-DNF formulas appearing in our conjunction, and that B 
itself is unsatisfiable. What should it mean that B is minimally unsatisfiable? 

The first, naive, attempt at a definition would be to require, by analogy with the k = 1 case, that B 
becomes satisfiable after removing any Di from it. However, the following simple example of two 2-DNF 
formulas 

{(x A yi) V . . . V (x A Vn), (xi A yi) V . . . V (x A (1) 

that is minimally unsatisfiable in this sense shows that we can not hope to get any meaningful analogue of 
Tarsi's lemma under this assumption only. 

The reason for this is that the 2-DNF set Q is not minimally unsatisfiable in the following sense: even 
if we "weaken" a formula in the set (i.e., make it easier to satisfy) by removing any, or even all, y- variables, 
then what remains is still an unsatisfiable set. This leads us to the stronger (and arguably more natural) notion 
that the formula set should be minimally unsatisfiable not only with respect to removing DNF formulas but 
also with respect to shrinking terms (i.e., conjunctions) in these formulas. Fortunately, this also turns out to 
be just the right notion for the proof complexity applications given in IIBSN0 9a1 (for details, we refer either 
to that paper or to Section |4]below). Therefore, following IBSN09al . we say that a set D of /c-DNF formu- 
las is minimally unsatisfiable if weakening any single term (i.e., removing from it any literal) appearing in 
a A;-DNF formula from D will make the "weaker" set of formulas satisfiable. This leads to the following 
question: 

How many variables (as a fiinction of k and m) may appear in a minimally unsatisfiable set 
{Di, . . . , Dm} of k-DNF formulas? 

Tarsi's lemma thus states that for /c = 1 the answer is (m — 1). This result has a relatively elementary 
proof based on Hall's marriage theorem, but its importance to obtaining lower bounds on resolution length 
and space is hard to overemphasize. For instance, the seminal lower bound on refutation length of random 
CNF formulas in fCSSS | makes crucial use of it, as does the proof of the "size- width trade-off" in [BSWOIV 
Examples of applications of this theorem in resolution space lower bounds include LABSRW02. .BSG031 
IB SNOHI lBSN09bl TO)8l INor09al . 

To the best of our knowledge, the case k > 2 had not been studied prior to (BSN09a|. That paper 
established an 0((m/i;)'^+^) upper bound and an ri(mA;^) lower bound on the number of variables. The gap 
is large, and, as one of their open questions, the authors asked to narrow it. 

In this paper, we give an almost complete answer to that question by proving an f](m)'^ lower bound 
on the number of variables. Our construction is given in Section |3l following a little bit of preliminaries 
in Section |2l Then, in Section 0] we discuss certain consequences of our result to proof complexity, the 
bottom line here being that in order to improve on the space complexity bounds from |BSN09a|, a different 
approach would be needed. The paper is concluded with a few remarks and open problems in Section |5l 

2 Preliminaries 

Recall that a DNF formula is a disjunction of terms, or conjunctions, of literals, i.e., unnegated or negated 
variables. If all terms have size at most k, then the formula is referred to as a k-DNF formula (where k 
should be thought of as some arbitrary but fixed constant). 

Definition 2.1 (|BSN09a|). A set of DNF formulas B is minimally unsatisfiable if it is unsatisfiable and 
furthermore, replacing any single term T appearing in a single DNF formula D G B with a proper subterm 
of T makes the resulting set satisfiable. 
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Note that this indeed generalizes the well-known notion of minimally unsatisfiable CNF formulas, where 
a "proper subterm" of a literal is the empty term 1 that is always true and "weakening" a clause hence 
corresponds to removing it from the formula. 

We are interested in bounding the number of variables of a minimally unsatisfiable A;-DNF set in terms 
of the number of formulas in the set. For 1-DNF sets (i.e., CNF formulas), Tarsi's lemma IAL86II states 
that the number of variables must be at most the number of formulas (i.e., clauses) minus one for minimal 
unsatisfiability to hold. This bound is easily seen to be tight by considering the example 

{X1,X2, . . . Xl V X2 V . . . V x„} . (2) 

No such bound holds for general k, however, since there is an easy construction shaving off a factor k^. 
Namely, denoting by Vars (B) the set of variables appearing somewhere in ID, we have the following lemma. 

Lemma 2.2 (|BSN09a|). There are arbitrarily large minimally unsatisfiable sets B of k-DNF formulas with 

I Vars(B)l > A;2(|B| - 1). 

Proof sketch. Consider any minimally unsatisfiable CNF formula consisting of n+1 clauses over n variables 
(for example, the one given in Substitute every variable Xi with 

{x\ A xf A • • • A x'y) V (x^^+i A ^ . . . ^ ^2k^ ^ • • • V (xf ^^=+1 A xf "'^+2 A • • • A xf ) (3) 

and expand every clause to a /c-DNF formula. It is straightforward to verify that the result is a minimally 
unsatisfiable /c-DNF set, and this set has n + 1 formulas over k'^n variables. □ 

There is a big gap between this lower bound on the number of variables (in terms of the number of 
formulas) and the upper bound obtained in |BSN09a|, stated next. 

Theorem 2.3 ( BBSN09al ). Suppose that B is a minimally unsatisfiable k-DNF set containing m formulas. 
Then [Vars(B)| < {kmf^'^. 

A natural problem is to close, or at least narrow, the gap between Lemma IT^ and Theorem 12.31 In this 
work, we do so by substantially improving the bound in Lemma l2!2l 

3 An Improved Lower Bound for Minimally Unsatisfiable Sets 

In this section, we present our construction establishing that the number of variables in a minimally unsatis- 
fiable /c-DNF set can be at least the number of formulas raised to the fcth power. 

Theorem 3.1. There exist arbitrarily large minimally unsatisfiable k-DNF sets B with m formulas over 
more than (^(l — ^))^ variables. 

In particular, for any k > 2 there are minimally unsatisfiable fc-DNF sets with m formulas over (more 
than) [m/sY variables. 

Very loosely, we will use the power afforded by the fc-terms to construct a fc-DNF set B consisting 
of roughly m formulas that encode roughly m^~^ "parallel" instances of the minimally unsatisfiable CNF 
formula in These parallel instances will be indexed by coordinate vectors [x\^,x1^, . . . ,x(^~^J. We 

will add auxiliary formulas enforcing that only one coordinate vector [x\^,xf^, . . . ,x\~^^ can have all 
coordinates true. This vector identifies which instance of the formula ^ we are focusing on, and all other 
parallel instances are falsified by their coordinate vectors not having all coordinates true. 

We now formalize this loose intuition. We first present the auxiliary formulas placing the constraints on 
our coordinate vectors, which are the key to the whole construction. 
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3.1 A Weight Constraint A;-DNF Formula Set 

Let us write x = (xi, . . . ,Xm{k-i)) to denote a vector of variables of dimension m{k — 1). Let |x| = 

Yl^i Xi denote the Hamming weight of x, i.e., the number of ones in it. We want to construct a A;-DNF 
set Wm{x) with 0(m) formulas over xi, . . . ,Xm{k-i) and some auxiliary variables minimally expressing 
that \x\ < 1. That is, a vector x can be extended to a satisfying assignment for Wm{x) if and only if [x| < 1 
but if we weaken any formula in the set, then there are satisfying assignments with |x| > 2. 

We define Wm {x) to be the set of fe-DNF formulas listed next. The intuition for the auxiliary variables 
is that Zj can be set to true only if the first j{k — 1) variables xi, . . . , are all false, and wj can be set 

to true only if at most one of the first j{k — 1) variables xi, . . . , is true. 

zi V (xi A • • • Axfe-i) (4a) 
Z2 V (zi A Xfc A • • • A X2(k-i)) (4b) 



Zm~l V {zm-2 A X(^jn-2){k-l)+l A • • • A a;(m_l)(A;_l) ) (4c) 
k k 

wiVzi^y /\ Xi> (4d) 



i=li'=l 
i'^i 



1/72 V 2;2 V (tui Axfc A • • • j;2(fc-i)) V \/ f A /\ x,;/ j 

4—h ^ ,' — 1. ^ 



2(fe-l) ^ 2(fe-l) 

(4e) 



i=k i'=k 



Wm-1 V Zm-1 V {Wm~2 A X(m-2)(fc-l)+l A • • • A X(m_i)(j!(._i)) 

(m-l)(fe-l) , (m-l)(fc-l) 
V V (2m-2A /\ 



i=(m-2)(fc-l)+l ^ i' = (m-2)(fc-l)+l 

i'j^i 



(4f) 



(Wm-l A X(m-l)(fc-l)+l A • • • A Xm(fc_l)) 

m(A;— 1) , m(fc— 1) 

V V (^m-lA /\ 



X,' 



i=(m-l)(A:-l)+l ^ i'=(m-l)(A:-l)+l 

i'jti 



(4g) 



The set of A;-DNF formulas Wm contains 2m — 1 formulas. Let us see that Wm minimally expresses that x 
has weight at most 1. For ease of notation, we will call the group of variables {x(j„i)(;j_i)_|_i, . . . , Xj(fc_i)} 
the jth block and denote it by Xj. 

Every x with |x| < 1 can be extended to a satisfying assignment for Wm{x). Since all x-variables 
appear only negatively, we can assume without loss of generality that |x| = 1, say all Xj are false except for 
a single variable in the joth block Xj^. We simply set Zj to true for j < Jq and false for j > Jq, and we set 
all Wj to true. 

Every satisfying assignment for Wm{x) satisfies |x| < 1. Assume on the contrary that Xj^ = Xjj = 1; 
ii G Xj-^ , i2 € Xj.-, ; ji < j2- We have that the truth of Xj^ forces zj to false for all j > ji, and then Xi^ = 1 
forces Wj to false for all j > j2- But this means that there is no way to satisfy the final formula (4g i. So for 
all satisfying assignments it must hold that |x| < 1. 

After weakening any term in Wm{x), the resulting set can be satisfied by an assignment giving 
weight at least 2 to x. First we notice that weakening any of the unit terms (i.e., terms of size one) results 
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in removing the formula in question altogether. This can only make it easier to satisfy the whole set than if 
we just shrink a k-term. Hence, without loss of generality we can focus on shrinking the fc-terms. Let us 
consider the formulas in Wm{x) one by one. 

If we remove some literal Xi in d4ab-d4cb. we can set Xi = 1 but still have zi = ■ ■ ■ = Zm-i = 1- This 
will allows us to set also x^(^i^_i-^ = 1 in ( Pg| ) and still satisfy the whole set of formulas although |x| > 2. 

If we instead remove some Zj {j < m — 2) in these formulas, then we can set all Xj = 1 for Xj € Xi U 
. . .U Xj (that already gives us weight > 2) and zi = . . . = Zj = 0, and then we set Zj+i = . . . = = 1 
and Xi = for Xi E -'^j+i . . . U . . . Xm- Note that j < m — 2 implies that Zm-i = 1 which takes care 
of ( |4gl i, and then (I4dl -(l4fli are satisfied simply be setting all Wj to 0. This completes the analysis of the 
formulas d4ali-(l4cb. 

In formula d4dt . if we remove some Xj/ in Ai'=i iY« ^^^^ '^^^ = Xi' = wi = 1 and extend 
this to a satisfying assignment for the rest of the formulas. 

For the corresponding terms zj^i A Ai'=(^-^i)(fc_i)+i i'^ti^j (l4el-(|4g]i, if we remove some Xj/, we 
can again set = Xj' = 1 and set zi = . . . = = 1 and then Wj = . . . = Wm-i = 1 to satisfy the 
rest of the set, whereas removing Zj-i would allow us to assign to 1 all Xj G Xi U . . . U and then still 
assign Wj = . . . = Wm-i = 1- 

For the other kind of terms wj^i A X(^j^ij(^k-i)+i A • • • A Xj(,t_i) in (l?et-(|?g|i, if some Xj with Xj G Xj 
is removed, we can set this Xj to true as well as an arbitrary Xj' G Xi U . . . U whereas removing Wj^i 

would allow as again to set to 1 all variables in Xi U . . . Xj_i. This proves the minimality of Wm{x). 

3.2 The Minimally Unsatisfiable k-DUF Set 

Let us write x^ = (x{, Xg, • • • , ^^^(^k-i))' ^^'^ Wm{x^) be the A;-DNF set with 0{m) formulas constructed 
above (over disjoint sets of variables for distinct j) minimally expressing that \x^ < 1. With this notation, 
let BJ^ be the fc-DNF set consisting of the following formulas: 

Wiix^) l<j<k (5a) 

V (< AxIa---A x1^ a yl,.„...,,_^ l<u<m{k-l) (5b) 



(il,i2,.--,«fc-i)e[m(fc-l)] 



V V (xl A xf^ A • • • A xf^i A J l<u<m{k-l) (5c) 

{jl,i2,...,ifc_i)G[m(fe-l)]'=-i ^ ^ 

ui V n2 V • • • V n„(fe_i). (5d) 

It is worth noting that the range of the index u does not have any impact on the following proof of minimal 
unsatisfiability, and it was set to m(A: — 1) only to get the best numerical results. 

It is easy to verify that consists of less than Amk A:-DNF formulas over more than {m{k — 1))^ = 
(|(4m/c)(l — ^))'^ variables. We claim that BJ^ is minimally unsatisfiable, from which Theorem 13.11 
follows. 

To prove the claim, let us first verify that BJ^ is unsatisfiable. If the CNF formulas Wm (x) in (l5al i are to 
be satisfied for all j < k, then there exists at most one (fc— l)-tuple ■ ■ ■ , G [m{k — 1)]'^"^ such 

that xj* , X?* , . . . , xjr^ are all true. This forces * s to true for all i' to satisfy the formulas in dSbl . 

and then d5cl forces all u,^ to 0, so that (l5dl is falsified. Contradiction. 

Let us now argue that is not only unsatisfiable, but minimally unsatisfiable in the sense of Defini- 
tion l2.1l The proof is by case analysis over the different types of formulas in BJ^. 

1. If we shrink any term in d5at — say, in W,^{x^), then by the minimality property in Section lTTl we can 
set some xj, = xj„ = 1 for i'^ ^ i'/ and then fix some x% = . . . = x^T^ = 1 without violating the 
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remaining clauses in W^{x^), . . . , ^(x'^ This allows us to satisfy the formulas in d5bl and 
d5cl by setting yf., .* .* ^ = 1 and y-j.„ .* .* = for all u, respectively. Finally, set any u, to 
true to satisfy d5db . This satisfies the whole fc-DNF set. 

2. Next, suppose that we shrink some term x}* A x?, A • • • A A yf-* * n in the i^th A;-DNF formula 

*i '■2 *fc-i ''ih'---'*fc-i^ 

in ( BBt . There are two subcases: 



(a) Some x-variable is removed, say, the variable xh . Set x}* = and x% = . . . = rrf, ^ 

*1 *1 *2 *fc-l 

y!^.* * , = 1. This satisfies the z^th formula in d5bt . Then pick some i\ / and set 

V*l'*2'---'*fc-l/ 

X V =1. All this can be done in a way that satisfies all clauses in d5al i since the weight of every 

x^ is one. Set = I and Uj^/ = for all u' ^ u to satisfy ( l5Hb and then y^., „ „ , = to 

satisfy the z/th formula in d5cb (all others are satisfied by literals u^i, u' ^ u). The z^th formula 
in ( l?Et was satisfied above, and for all other u' ^ v vit set yY, * * s = 1 to satisfy the rest 

of the formulas in dSbb . This satisfies the whole A;-DNF set. 

(b) The variable wj".. s is eliminated. If so, set x]* = . . . = x^^^ = 1 to satisfy the vth formula 
in d5bb . Uu = \ and y^„ * = to satisfy d5dt and the z/th formula in d5cl . and Uyi = and 
y^.* „ N = 1 for all z/' 7^ zv to satisfy the rest of the formulas in and ( I5ct . This is easily 
extended to an assignment satisfying ( I5al i as well. 

3. For the z/th formula in d5cb . we may assume, for the same reasons as in Section ITTI that we shrink a 
non-trivial /c-term. Then we again have two subcases, treated similarly. 



(a) Some x-variable is removed, say xL. Set Uy = 1, x]* = Q, x% = . . . = x-, = 1, and 

'1 n '2 *fc-i 

y^., „ ^ = 0. This satisfies (I5dt and the z/th formula in dScl . Setting tZi,/ = for z/' 7^ z/ 
takes care of the rest of d5cl . To satisfy d5bt . we pick some i'^ 7^ and set x^^, = 1, and then set 
y^'-i „ , s = 1 for all u' . All this can be done in a way that satisfies the weight constraints 

V l'^2' ---'^fc-l^ 

in (Bal l. 

(b) The literal yY, , ^ is eliminated. If so, set x]* = . . . = x^~^ = 1 to satisfy the z/th formula 

l*ii---i*fe_iJ *i *fc-i 

in dScb and Uy = Ito satisfy d5db . Setting Uj^/ = for z/' 7^ z/ takes care of the rest of d5cl . Now 

we can satisfy all of ( I5EI by setting y!^., .* \ = 1 for all z/, and it is once again easy to see that 

>.*i '■••'*fe-ii 

the weight constraints in d5al i are also satisfied. 
4. d5db is removed. Set all Uy to 0, and set all y'^^ to 1, then (Bali- dSbb become easy to satisfy. 
This completes the proof that is minimally unsatisfiable as claimed, and Theorem l3 . 1 I hence follows. 

4 Implications for Time-Space Trade-offs for A:-DNF Resolution 

Let us start this section by a quick review of the relevant proof complexity context. The k-DNF resolution 
proof systems were introduced by Kraji'cek F KraOll as an intermediate step between resolution and depth-2 
Frege. Roughly speaking, the fcth member of this family, denoted henceforth by '0\{k), is a system for 
reasoning in terms of /c-DNF formulas. For k = 1, the lines in the proof are hence disjunctions of literals, 
and the system 9^(1) is standard resolution. At the other extreme, 9^(oo) is equivalent to depth-2 Frege. 

Informally, we can think of an lH(fc)-proof as being presented on a blackboard. The allowed derivation 
steps are to write on the board a clause of the CNF formula being refuted, to deduce a new fc-DNF formula 
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from the formulas currently on the board, or to erase formulas from the board. The length of an 5^(fc)-proof 
is the total number of formulas appearing on the board (counted with repetitions) and the (formula ) space is 
the maximal number of formulas simultaneously on the board at any time during the proof. 

A number of works I AB04 . ABE02 Ale05 JN02 Raz03 $BI04 Seg05J have shown superpolynomial 



lower bounds on the length of fe-DNF refutations. It has also been established in IISBI04I Seg05| that the 
9^(A;)-family forms a strict hierarchy with respect to proof length. Just as in the case for standard resolution, 
however, our understanding of space complexity in fc-DNF resolution has remained more limited. Esteban 
et al. I EGM04I established essentially optimal space lower bounds for 9\{k) and also proved that the family 
of tree-like 1H(A:) systems form a strict hierarchy with respect to space. They showed that there are formulas 
Fn of size n that can be refuted in tree-like {k + 1)-DNF resolution in constant space but require space 
Q(n/ log^ n) to be refuted in tree-like /c-DNF resolution. It should be pointed out, however, that tree-like 
9^(A;) for any A; > 1 is strictly weaker than standard resolution, so the results in 1 1 Xi. VI 04 1 left open the 
question of whether there is a strict space hierarchy for (non-tree-Uke) fc-DNF resolution or not. 

Recently, the first author in joint work with Ben-Sasson IIBSN09al proved that Krajicek's family of 9^(A;) 
systems do indeed form a strict hierarchy with respect to space. However, the parameters of the separation 
were much worse than for the tree-like systems in IEGM04.I . namely that the 9^(A; + l)-proofs have constant 
space but any 9^(A;)-proof requires space r2( '°^\/ n / log n) . It is not clear that there has to be a (fc + l)st 
root in this bound. No matching upper bounds are known, and indeed for the special case of 91(2) versus 
1H(1) the lower bound is Q.{n/\og n) by IBSN09b 1. i.e., without a square root. Also, combining f BSN09all 
with results in LBSN09 b | one can derive strong length-space trade-offs for /c-DNF resolution, but again a 
{k + l)st root is lost in the analysis compared to the corresponding results for standard resolution iH(l). 

Returning now to the minimally unsatisfiable /c-DNF sets, the reason for studying this concept in 
IBSN 09a1 was that is was an interesting special case of a more general problem arising in their proof anal- 
ysis, and that is was hoped that better upper bounds for this special case would translate into improvements 
for the general case. Although there appears to be no such obvious translation of lower bounds from the 
special to the general case, by using the ideas from the previous section we can show that the analysis of 
the particular proof technique employed in fBSN09al is almost tight. Thus, any further substantial improve- 
ments of the bounds in that paper would have to be obtained by other methods. 

We do not go into details of the proof construction in |BSN09a| here, since it is rather elaborate. Suffice 
it to say that the final step of the proof boils down to studying A;-DNF sets that imply Boolean functions 
with a particular structure, and proving lower bounds on the size of such DNF sets in terms of the number of 
variables in these Boolean functions. Having come that far in the construction, all that remains is a purely 
combinatorial problem, and no reference to space proof complexity or A;-DNF resolution is needed. 

For concreteness, below we restrict our attention to the case where the Boolean functions are exclusive 
or. More general functions can be considered, and have been studied in | B SN09a' 'B SN09b | , and everything 
that will be said below applies to such Boolean functions with appropriate (and simple) modifications. 
Hence, from now on let us focus on DNF sets that minimally imply a particular kind of formulas that we 
will refer to as (^AVQ)'') -block formulas. A (aV©'') -block formula is a CNF formula in which every variable 
X is replaced by 0(Li where xi, . . . , are new variables not appearing in the original formula. Thus, 
literals turn into unnegated or negated XORs, every XOR applies to exactly one "block" of k variables, and 
no XOR mixes variables from different blocks. Let us write this down as a formal definition. 

Definition 4.1. A i^/W/®^^ -block formula G is a conjunction of disjunctions of negated or unnegated exclu- 
sive ors. The variables of G are divided into disjoint blocks xi, . . . , Xk, yi, . . . ,yk, zi, . . . , Zk et cetera, of 
k variables each, and every XOR or negated XOR is over one full block of variables. 

The key behind the lower bounds on space in |'BSN09a1 is the result that if a A;-DNF set D implies a 
(AV©'^^^) -block formula G with many variables, then D must also be large. 
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Theorem 4.2 (|BSN09a|). Let k be some fixed but arbitrary positive integer Suppose that D is a k-DNF 
set and that G is a i^f^®^^"^^ -block formula such that B implies G, and fiirthermore that G is minimal in 
the sense that if we remove a single XOR or negated XORfrom G (thus making the formula stronger), it no 
longer holds that D implies G. Then \ Vars{G)\ = 0(|Dl'=+^). 

Using this theorem, one can get the n / log n space separation mentioned above between /^-DNF 
resolution and (/c+l)-DNF resolution. Any improvement in the exponent in the bound in Theorem 14.21 
would immediately translate into an improved space separation, and would also improve the time-space 
trade-offs one can get when transferring the results in |BSN09b| from resolution to A;-DNF resolution. 

Prior to the current paper, the best lower bound giving limits on what one could hope to achieve in 
Theorem lO was linear, i.e., | Vars{G)\ = Namely, let G be a conjunction of XORs {^^^i Xi) A 

(0jta yi) ^ i®i=i and let ID) be the union of the expansions of every 0^^^ Xi as a CNF formula. 

For this particular structure of G it is also easy to prove that | Vars{G)\ = 0(|B|) for any choice of D, but 
it has been an open question what happens when we consider general formulas G. 

For k = 1, IIBSN09 bl proved that a linear bound 0(|][I)|) in fact holds for any set of clauses B and any 
(aV©^) -block formula G, but all attempts to extend the techniques used there to the case k > 1 have failed. 
And indeed, they have failed for a good reason, since building on the construction in Section|3]we can show 
that this failure is due to the fact that the best one can hope for in Theorem 14. 21 is | Vars{G) \ = 0[\I1)\^). 

Theorem 4.3. For any k > 1 there are arbirarily large k-DNF sets B of size [B| = m and [a\/(B^~^^) -block 
formulas G such that B implies G, this implication is "precise " in the sense that if we remove a single 
XOR or negated XORfrom G it no longer holds that B implies the strengthened formula, and \ Vars{G) | > 

Proof. We utilize all the previous notation and start with the CNF formula 

ue[m{k-l)] (n,..,ife_i)e[m(A:-l)]'=-l 

and substitute an exclusive or over variables y'^^^ ^, r = 1, . . . , + 1, for every variable y^''^ ^. This 
results in the formula 

ue[m(k-l)] (ii,...,ifc„i)e[m(A:-l)]'=-i r=l 

which will be our (aV©'^'*"^) -block formula. Clearly, G contains (A; + 1) • {m{k — 1))'^ variables. We claim 
that the following easy modification of the fc-DNF set from Section "precisely" implies G in the sense 
of Theorem l4.3l 



W^{x^) l<j<k (8a) 

V (4 A---Axf-; Ay^i.,,.,) l<u<m{k-l) (8b) 
(ii,...,ifc_i)eH''"^ 

V (4 ^ • • • ^ ^ti ^ C...,*fe-i) l<u<mik-l),2<r<k + l (8c) 

(n,.--,«fc-i)G[m]''— 1 

It is straightforward to verify that B consists of less than m{k — l)(fc + 1) + 2mk < mk{k + 2) fc-DNF 
formulas. B implies G since once we have picked which variables xj, ,x%,..., xf."^ should be satisfied, 
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5 CONCL UDING REMARKS AND OPEN PROBLEMS 



D will force all XOR blocks 0^+| y^f , j e [m{k - 1)] to true by requiring the variable y.* •* to 
be true and all other variables y^S .« , r > 2, to be false. Finally, it is also easy to verify that D implies 

1 ■ ■' k — l 

G "precisely" in the sense that if a single XOR block 0^^} y^f ^* is removed from G, then we can 
satisfy D but falsify the rest of the formula G (the proof is very similar to the one given in Section IT2l . 
Theorem l4.3l follows. □ 



5 Concluding Remarks and Open Problems 

We conclude this paper by discussing two remaining open problems. 

Firstly, the most obvious problem still open is to close the gap between $7(m)^ and 0((mA;)^+^) for 
the number of variables that can appear in a minimally unsatisfiable fc-DNF set with m formulas. There 
is a strongly expressed intuition in [BSN09al that it should be possible to bring down the exponent from 
k + I to k. Hence we have the following conjecture, where for simplicity we fix k to remove it from the 
asymptotic notation. 

Conjecture 1. Suppose that B is a minimally unsatisfiable k-DNF set for some arbitrary but fixed positive 
integer k. Then the number of variables in D is at most 0(|D|)'^. 

Proving this conjecture would establish asymptotically tight bounds for minimally unsatisfiable A;-DNF 
sets (ignoring factors involving the constant k). 

Secondly, we again stress that the result in Theorem 14.31 does not per se imply any restrictions (that 
we are aware of) on what space separations or time-space trade-offs are possible for A;-DNF resolution. 
The reason for this is that our improved lower bound only rules out a particular approach for proving 
better separations and trade-offs, but it does not say anything to the effect that the /c-DNF resolution proof 
systems are strong enough to match this lower bound. It would be very interesting to understand better the 
strength of fc-DNF resolution in this respect. Hence we have the following open problem (where we refer to 
|[BSN09b| or |Nor09b| for the relevant formal definitions). 

Open Problem 2. Let Peb^^ [©] be the XOR-pebbling contradiction over some directed acyclic graph G. 
Is it possible that k-DNF resolution can refute Peb^^ [©] in space asymptotically better than the black- 
white pebbling price BW-Peb{G) ofG? 

We remark that for standard resolution, i.e., 1-DNF resolution, the answer to this question is that XOR- 
pebbling contradictions over two or more variables cannot be refuted in space less than the black-white 
pebbling price, as proven in |BSN09b |. For fc-DNF resolution with k > \, however, the best known lower 
bound is 0( ''+^ BW-Peb{G)) , as shown in |BSN09al. There is a wide gap here between the upper and 
lower bounds since, as far as we are aware, there are no known fc-DNF resolution proofs that can do better 
than space linear in the (black) pebbling price (which is achievable by standard resolution). 
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